A Methodology to Identify and Prioritize Gene Candidates for Human Disease

نویسنده

  • Jesus Sainz
چکیده

In a study published this month in Frontiers in Applied Genetic Epidemiology (Zhi et al., 2012), the authors have studied left ventricular hypertrophy (LVH), the thickening of the myocardium of the left ventricle of the heart, which is a trait that can be used as heritable predictor of cardiovascular disease, to identify genomic variants and genes that could be used as predictive markers of increased left ventricular mass. They used next generation sequencing to produce data from the whole exome of a hypertensive population and from total mRNA of a cellular model of LVH. The authors identified 31,426 genomic missense or nonsense mutations in seven African American sibling trios with high familial left ventricular mass indexed to height (LVMH). Using regression analyses, they found out that 295 of these variants, located in 265 genes, were associated significantly to LVMH after adjusting for multiple testing. They also produced total mRNA sequence data from a cellular model of LVH, hypertrophic cardiomyocytes, that was compared to the expression data from control cardiomyocytes producing a list of differentially expressed genes (using as cut off a value of P < 0.05 without adjusting for multiple testing). The LVH differential expression genes were compared with the list of genes with LVMH associated variants, producing 44 genes that were common to both lists. Gene Ontology analysis of the authors 44 genes list indicates a significant enrichment of genes involved in the cell cycle process (Chi test; P = 0.00016, adjusted for multiple testing) and overrepresentation in the cell adhesion process. Pathway analysis indicates that 2 of the 44 common genes (THBS1 and COL6A3) are part of the signaling by Platelet-derived Growth Factor (PDGF) pathway which has been implicated in tissue remodeling, being PDGF a potent stimulator of growth. Data from the Gene Reference into Function database (GeneRIF) in NCBI, indicates that five of these common genes (HLA-B, HTT, THBS1, PAPPA, and SYNE1) have been implicated in the literature with cardiovascular risk, heart disease, or heart failure in human and in mice, and polymorphisms of another (PER3) have been associated to the sympathovagal balance in cardiac control. When the authors adjusted the P-values of the differential expression data for the number of tested genes, they reduced the initial list to 11 genes with differential expression and variants associated to LVMH reaching statistical significance. Pathway analysis indicates that seven of these genes are annotated and belong to pathways such as cell cycle, signaling by PDGF, or regulation of Insulin-like Growth Factor among others. GeneRIF analysis of this new gene list, produced with more stringent criteria for the expression data that reduced to 25% the number of genes to analyze, shows that among the resulting 11 genes still were included 3 out of the 6 genes implicated with heart conditions or heart control (THBS1, PAPPA, and PER3) and one from the signaling by PDGF pathway (THBS1). This further enrichment in genes involved with heart conditions or control is in agreement with the hypothesis of the authors that the differential expression data used is a good criterion to identify heart disease risk genes and, consequently, that the novel cellular model of LVH used to obtain the expression data could be useful to provide functional information of the phenotype. The authors used another approach to select the genes by applying a candidate gene prioritization strategy to the 44 initial genes using seven different criteria which included statistical, linkage, functional, conservation, and allele frequency data among others. This resulted in five genes that satisfied at least three of the seven criteria (HLA-B, HTT, MTSS1, SLC5A12, and THBS1). Interestingly, three of them (HLAB, HTT, and THBS1) are among the genes implicated with heart conditions, being one of them among the 11 with significantly differential expression after adjusting for the number of tests performed. This enrichment of heart condition genes supports that the seven criteria used by the authors are good predictors to identify LVH risk genes. None of the two main technologies used in this approach, RNA sequencing and whole exon-sequencing, is novel in the search for gene candidates to cause disease. However, the combination of whole exome sequencing, gene expression data from a cell model of the trait, and mainly, the gene prioritization strategy using seven additional criteria that includes public annotations and statistical data, provides a novel strategy. In conclusion, the methodology used, despite all the limitations described by the authors in the article, seem to provide an enrichment of genes involved with the trait of study, provide new candidates for further A methodology to identify and prioritize gene candidates for human disease

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Human Disease Genes by Human-Mouse Conserved Coexpression Analysis

BACKGROUND Even in the post-genomic era, the identification of candidate genes within loci associated with human genetic diseases is a very demanding task, because the critical region may typically contain hundreds of positional candidates. Since genes implicated in similar phenotypes tend to share very similar expression profiles, high throughput gene expression data may represent a very impor...

متن کامل

Identify, Explain and Prioritize Human Resource Planning Factors in Order to Manifest Organizational Citizenship Behavior by Employees

Purpose: Considering the importance of human resource planning for organizational citizenship behavior, the purpose of this study was identify, explain and prioritize human resource planning factors in order to occurrence organizational citizenship behavior by employees. Methodology: The present research was descriptive from type of qualitative-quantitative. The research population in the qual...

متن کامل

A Methodology to Prioritize the Construction Projects of New Railway Infrastructures for Privatization in Railway Networks (Case Study: Iran)

This study aims to develop a novel methodology to prioritize the construction of new railway infrastructures for privatization. The private sector can cooperate to solve the capacity problems of railway networks, by the construction of new infrastructure. The purpose of this study is to answer the basic question that whether the capacity problems of the railway networks can be solved simply by ...

متن کامل

Guilt by rewiring: gene prioritization through network rewiring in genome wide association studies.

Although Genome Wide Association Studies (GWAS) have identified many susceptibility loci for common diseases, they only explain a small portion of heritability. It is challenging to identify the remaining disease loci because their association signals are likely weak and difficult to identify among millions of candidates. One potentially useful direction to increase statistical power is to inco...

متن کامل

Computational approach to identify deletions or duplications within a gene

Although high-throughput methods exist to identify most small disease causing mutations (e.g. substitutions that alter an amino acid), assays to identify larger classes of mutations such as deletions/duplications are time consuming, laborious and expensive. No in-silico system exists to identify intragene deletion or duplication candidates. We hypothesize that a computational system, SPeeDD (Sy...

متن کامل

Network Analysis of Differential Expression for the Identification of Disease-Causing Genes

Genetic studies (in particular linkage and association studies) identify chromosomal regions involved in a disease or phenotype of interest, but those regions often contain many candidate genes, only a few of which can be followed-up for biological validation. Recently, computational methods to identify (prioritize) the most promising candidates within a region have been proposed, but they are ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2012